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Abstract — This paper focuses on a particular transmission 
scheme called local network coding, which has been reported 
to provide significant performance gains in practical wireless 
networks. The performance of this scheme strongly depends 
on the network topology and thus on the locations of the 
wireless nodes. Also, it has been shown previously that finding 
the encoding strategy, which achieves maximum performance, 
requires complex calculations to be undertaken by the wireless 
node in real-time. 

Both deterministic and random point pattern are explored and 
using the Boolean connectivity model we provide upper bounds 
for the maximum coding number, i.e., the number of packets 
that can be combined such that the corresponding receivers are 
able to decode. For the models studied, this upper bound is of 
order of >/N 9 where N denotes the (mean) number of neighbors. 
Moreover, achievable coding numbers are provided for grid- 
like networks. We also calculate the multiplicative constants that 
determine the gain in case of a small network. Building on the 
above results, we provide an analytic expression for the upper 
bound of the efficiency of local network coding. The conveyed 
message is that it is favorable to reduce computational complexity 
by relying only on small encoding numbers since the resulting 
expected throughput loss is negligible. 

Index Terms — encoding number, network coding, random net- 
works, wireless 

I. Introduction 

Network coding is an exciting new technique promising 
to improve the limits of transferring information in wireless 
networks. The basic idea is to combine packets that travel 
along similar paths in order to achieve the multicast capacity 
of networks. The network coding scheme called local network 
coding was one of the first practical implementations able to 
showcase throughput benefits, see COPE in flTTl . The idea 
of local network coding is to encode packets belonging to 
different flows whenever it is possible for these packets to be 
decoded at the next hop. The simplicity of the idea gave hopes 
for its efficient application in a real-world wireless router. In 
the simple Alice-relay-Bob scenario, the relay XORs outgoing 
packets while Alice and Bob use their own packets as keys for 
decoding. The whole procedure offers a throughput improve- 
ment of 4/3 by eliminating one unnecessary transmission. 

Local network coding has been enhanced with the func- 
tionality of opportunistic listen. The wireless terminals are 
exposed to information traversing the channel, and ifTTIl pro- 
posed a smart way to make the best of this inherent broadcast 
property of the wireless channel. Particularly, each terminal 
operates in always-on mode, overhearing constantly the chan- 
nel and storing all overheard packets. The reception of these 
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packets is explicitly announced to an intermediate node, called 
the relay, which makes the encoding decisions. Finally, the 
relay can arbitrarily combine packets of different flows as long 
as the recipients have the necessary keys for decoding. Using 
the idea of opportunistic listen, an infinite wheel topology, 
where everyone listens to everyone except from the intended 
receiver, can benefit by an order of 2 in aggregate throughput 
by diminishing the downlink into a single transmission, see 
l4li . The wheel is a particular symmetric topology that is 
expected to appear rarely in real settings. Also, the above 
calculations take into account that all links have the same 
transmission rates, thus it takes the same amount of time to 
deliver a native (non-coded) packet or an encoded one. In 
addition, all possible flows are conveniently assumed to exist. 
This, however, is not expected to be a frequent setting in a real 
world network. A natural question reads: what is the expected 
throughput gain in an arbitrary wireless ad hoc network? 

The maximum gain does not come at no cost either. De- 
ciding which packets to group together in an encoded packet 
is not a trivial matter as explained in (4), in ER ifTTl and 
in CLONE lfT6l . In the latter case, the medium is assumed 
to be lossy, and the goal is to find the optimal pattern of 
retransmissions in order to maximize throughput. In the first 
case, a queue-length based algorithm is proposed for end-to- 
end encoding of symmetric flows (i.e. flows that one's sender 
is the other's destination and the other way around.). All these 
decision-making problems are formulated as follows. Denote 
N(fi) the set of nodes in need of a packet belonging to flow 
fi and H(fi) the set of nodes having it. Then the encoded 
combination of two packets belonging to flows fi and fj can 
be decoded successfully if and only if N(fi) C H(fj) and 
N(fj) C H(fi). If this condition is true we draw an edge on 
the coding graph with vertices all the possible packets. Then 
finding the optimal encoding scheme is reduced to finding a 
minimum clique partition of the coding graph, a commonly 
known NP-hard problem, IfTTl . Moreover, the same complexity 
appears when the relay node makes scheduling decisions, i.e., 
selecting which packets to serve and with what combinations. 
Work related to index coding has shown that this problem 
can be reduced to the boolean satifyability problem (SAT 
problem), l5l . 

Thus a second question arises: what is the loss in throughput 
gain if instead of searching over all possible encoded packet 
combinations, we restrict our search in combinations of size at 
most ml In this paper we are interested in showing that, for a 
real ad hoc wireless network, opportunities for large encoding 
combinations rarely appear. To show this, we consider regular 
topologies like grids as well as random ones. We calculate 
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the maximum encoding number in these scenarios in the 
mean sense and we consider small as well as large networks. 
To capture the behaviour of large (or dense) networks, we 
examine the scaling laws of maximum encoding number. 

Scaling laws are of extreme interest for the network com- 
munity in general. Although they hold asymptotically, they 
provide valuable insights to the system designers. In this 
direction, the authors in fT8l study the wireless networks 
scaling capacity in a Gupta-Kumar way taking into account 
complex field NC. ifTol also examines the use of NC for 
scaling capacity of wireless networks. They find that NC 
cannot improve the order of throughput, i.e. the O {^=^j 
law prevails. Q discusses the issue of scaling NC gain in 
terms of delays while |12] identifies the energy benefits of 
NC both for single multicast session as well as for multiple 
unicast sessions. In [8], NC is used instead of power control 
and the benefits are characterized. In a similar spirit, fT3l 
investigates the use of rate adaptation for wireless networks 
with intersession NC. Utilizing rate adaptation, it is possible to 
change the connectivity and increase or decrease the number 
of neighbors per node. They identify domains of throughput 
benefits for such case. 

The most relevant work in the field is |9|. The authors 
analyze the maximum coding number, i.e., the maximum 
number of packets that can be encoded together such that the 
corresponding receivers are able to decode. They show that this 
number scales with the ratio j where S is a region outside the 
communication region and inside the interference region. Note 
however, that this work does not yield any geometric property 
for the frequency of large combinations since it relies only 
on specific protocol characteristics. In networks with small 5, 
e.g., whenever a hard decoding rule is applied, there is no 
bound for the maximum coding number. 

In this paper we study the problem from a totally different 
point of view, showing that there exist inherent geometric 
properties bounding the maximum coding number below a 
number relative to the population or density of nodes. More- 
over, we apply the Boolean connectivity model for which 
6 = 0, and thus the previous result does not provide any bound 
at all. 

We show that the upper bound of the maximum coding 
number is related to a convexity property that any valid combi- 
nation has. We start by considering a fixed separation distance 
network, like a square grid, and show that in such networks, 
the maximum coding number is 0(VN) and ^(a/7V)Q where 
N is the number of nodes in transmission range of the relay. 
This implies that, even in networks with canonically placed 
nodes, the maximum coding combination is line- shaped even 
though the set of all nodes live in 2-dimensions. Next we study 
a random network where the locations of the nodes follow a 
Poisson point process on the plane. In this case, the maximum 
encoding number is found to be bounded in probability by 
0(A2+ e ), where A is the node density and e > arbitrary. 
Finally we consider the case where the encoder searches for 



1 The symbol 0() denotes that the function is bounded above by some 
linear function of the expression in the brackets whereas f2() denotes that the 
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Fig. 1. The set of valid nodes V is selected inside the disk of radius R and 
origin the location of vq. 



combinations of at most size m < N. We show that the 
throughput efficiency loss in this case depends on the size of 
the network, and for small networks the loss can be negligible. 
This way we motivate heuristic algorithms that avoid the high 
complexity arising in encoding selection. Through extensive 
simulations we show that all the derived results hold in general 
even for small networks. 

The paper is organized as follows. In Section [TTJ the model 
is described and some basic properties are given. In Section 
[HT| the main results for the case of grid-like networks are 



derived. Then in Section IV the case of randomly positioned 
networks is considered. A rate analysis is provided in Section 
|V| and simulation results are shown in Section VI The paper 
is concluded in Section IVIII 



expression is bounded from below. 



II. Communication model 

We assume a set of nodes V, positioned on the plane. 
Communications between these nodes are established via the 
Boolean interference model (see, e.g., (7)). In this model, 
a link between two nodes {vi,vj} is realized if and only 
if \K(vi) — "K(vj)\ < R. In this case, we say that V{ is 
connected with Vj and vice versa. Note that the Boolean 
interference model is an undirected graph in the sense that 
only bi-directional links appear. 

A. Information flow 

Each node vi having degree deg (vi) > 1, apart from 
transmitting and receiving, relays information. In this process, 
it is possible to avoid unnecessary transmissions by employ- 
ing local network coding. To simplify the analysis, we will 
consider only one cell, i.e., we will focus on a given node vq, 
and all its neighbors, and calculate the network coding gain 
on the downlink of this node. A similar result, then, holds 
for any such node serving as a relay. Thus we restrict V to 
contain all neighbors of vq, with V = {^1,^2, . . . , vn} and 
N = \V\ the number of nodes under consideration. For a 
network determined by a Poisson point process with density 
A, we use correspondingly the mean number of points which 
is given by E[N] = XirR 2 . The main objective of this paper 
is to find how the maximum coding number and the maximum 
network coding gain scale with the number of nodes. Also we 
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will provide bounds for the scaling constants which are useful 
for determining the behaviour in small networks. 

Apart from the number of neighbors, the gain analysis 
depends also on the activated flows. In the simple Alice- 
relay-Bob topology, it is possible that only the flow going 
from Alice to Bob is activated, in which case the gain is 
zero. In this paper we are interested in determining an upper 
bound for the efficiency loss when the relay is constrained 
on combinations of size m < N (e.g. if m = 2 the system is 
constrained to pairwise XORing). For this reason, we consider 
the maximum gain scenario. For each node designated as a 
relay, we assume that all possible two-hop flows traversing 
this relay are activated. This means that each node designated 
as a relay, has all possible different packets from which to 
select an XOR combination to send to the neighbors. Since 
not all of those combinations are valid, finding the maximum 
valid combination that corresponds to the maximum coding 
number is a non-trivial task and will be the goal of this paper. 
The resulting bound will help characterize the efficiency loss 
due to resorting to m-wise encoding. In real systems, some 
flows might not be active in which case the resulting efficiency 
loss from m-wise encoding will be even smaller. 

To make this more precise, similar to [17], we define source- 
destination pairs designating 2-hop flows that cross the relay. 
Each flow / G T has a source S(f), a destination D(f), 
a set of nodes having it H(f) C V (either by overhearing 
or ownership) and a set of nodes needing it N(f) C V. We 
write C because at least one node, the destination D(f) or the 
source S(f), is not part of H(f) and N(f) correspondingly. 
Two flows fi , fi are called symmetric when they satisfy the 
property S(fr) = D(f 2 ) and D(fi) = S(f 2 ). 

B. Constraints 

Here we summarize the previous subsection in the form of 
constraints. We will focus on network coding opportunities 
appearing in the aforementioned arbitrary network around the 
relay vq. 

Definition 1. (valid node): A node Vi G V is a valid node if 
\X(v t )-X(v )\<R 

Definition 2. (valid flow): A flow f G T is a valid flow if 
S(f) and D(f) are valid nodes not neighboring with each 
other, i.e., |X(S(/)) - X(Z>(/))| > R. 

Definition 3. (valid combination): A subet of flows C C T with 
C = {/i, • • • •> fc}> where C = \C\, is a valid combination 
if 

• each flow fi G C is a valid flow, 

• every pair of flows fi,fj G C, fi ^ fj satisfies N(fi) C 
H(fj) and N(fj) C H(fi) or equivalently, S(f) is 
connected with D(fj) while S(fj) is connected with 
D(fi). 

We define the maximum coding number C max as the 
greatest cardinality among all valid sets C. If the positions of 
the network are random, C max is evidently a random variable. 

Note that we could impose additional constraints. For ex- 
ample, if a flow can be routed more efficiently by a node other 



than vo, then this flow should be excluded from the set of valid 
flows. This would restrict further the set of valid combinations 
and thus by omitting this constraint we derive an upper bound 

for C max . 

Next, we state some fundamental properties of the valid 
combinations. For each flow / belonging to a valid combina- 
tion C we have 

. £>(/) e N(f), 

. £>(/) e H(j) for all jeC\ {/}, 
which leads us to the following properties. 

Remark 1. The destination node of fi G C is different from 
the destination node of any other flow fj G C \ {fi}. 

Remark 2. The source node of f G C is different from the 
source node of any other flow fj G C\ {fi}. 

Next, we provide a result on the topology for a valid 
combination. Let Xq represent the set of locations of all 
nodes being the source or destination of a flow belonging to 
a combination C. 

Lemma 1. Any valid combination of size 3 or larger corre- 
sponds to a convex polygon ( the polygon is formed using the 
set Xq as edges). 

Proof: Consider a valid combination defined by flows 

c = {/;,; = i,...,c}, 

where C > 3. Consider also the set of nodes that are sources 
and/or destinations in C 

V c = UiS(fi) U, D{fi) 

and the induced set of locations Xq such that we have a 
bijective mapping for each element Vj G Vc with an element 

X(vj) e X c . 

Assume that there is a node Vj G Vc which is an interior 
point of the convex hulj^Jof Xq. Thus its location Xj = X(^j) 
can be written as Xj — J2i^j a i^i where J2i^j a i = 1 anc * 
ot-i > for all i. 

On the other hand, there is a unique Vj* G Vc, which is the 
communicating pair (source or destination) of Vj in at least 
one flow, so that 

\X j -X r \>R. (1) 

All the other nodes (destinations or sources) in Vc should 
be able to reach the node Vj* directly. Thus, 

\ x j ~ x r\ < ^2^\Xi -Xj* \ <^aiR< R, 

which is a contradiction to ([T]). Consequently the node Vj, as 
well as all other nodes of the combination, necessarily lie on 
the perimeter of the convex hull. Thus, the nodes of a valid 
combination are the vertices of a convex polygon. ■ 
When the set of sources is identical to the set of destinations, 
the combination consists of symmetric flows only and C = 
2(|V'| - 1), V'CV. 

2 The convex hull of points X is the minimal convex set containing X. 
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In order to calculate an upper bound of the network coding 
combination size, it is enough to resort to the case of sym- 
metric flows. 

Lemma 2. For any valid combination there exists at least 
one combination of the same or larger size that contains only 
symmetric flows. 

Proof: We will show that for any flow we can add the 
symmetric one without invalidating the combination as long 
as it is not already counted. 

In a bipartite graph with all the nodes V on one side and 
the destinations of C on the other, consider a directional link 
if, between the source of flow / and its destination, for each 
/ G C. Note now that the nodes having out-degree one, i.e., 
the active sources in C, may or may not be identical to one 
of the destination nodes. We can make a partition of the set 
of active sources by assigning those with the above property 
to the set T sy m and the rest to the complementary set T sym - 
If Tsym = 0, then the Lemma is proved since C is a valid 
combination with symmetric flows only. If not, then we can 
create a new combination C which has more flows than the 
original one using the following process. For each transmitter 
in T S ym, say S(fi) the transmitter of flow fi, add one extra 
flow f{ with S(fl) = D(fi) and D(f!) = This flow 

does not belong to C (because S(fi) G Tsym) and it does not 
invalidate the combination due to the bidirectional properties 
of the model. Note that f[ is a valid flow because S(f-) cannot 
be connected to D{f' i ) due to validity of fi. Note also that 
S(f[) is connected to D(f) for all / G C since this is again 
required for the decoding of the original flows. Thus, for any 
flow we can add the symmetric one without invalidating the 
combination. ■ 

Remark 3. If a valid combination consists of symmetric flows 
only, its size must be even. 

In graph theory terms, a valid combination with symmetric 
flows can be thought of as a graph created by a clique of C + 1 
nodes, minus a matching with ^ edges, with all symmetric 
flows defined by this matching activated. This graph is called 
in ifTTIl wheel topology. 

III. Analysis in grid-like topologies 

In this section we focus on positioning the nodes on a grid. 
Grid topologies often offer an insightful first step approach 
towards the random positioning behaviour. Also, the investi- 
gation of grids answers the question whether it is possible to 
achieve high NC gain by arranging the locations of the nodes. 

We therefore assume a network with the additional property 
|X(i>i) — X(i>j)| > d, for any pair of nodes Vi,Vj G V. This 
condition pertains to regular grids such as the square, the 
triangular and the hexagonal grid as well as other grids with 
non-uniform geometry. We impose nevertheless the property 
that the node density is the same over all cells and thus the 
geometry should be somehow homogeneous. The number of 
nodes inside a disk or radius R will be N = O (('f ) 2 ) for 
these networks and the corresponding node density A grid = 

om 2 )- 



Theorem 1. (Upper bound) The maximum coding number in 
fixed- separation networks is O (^/N^J where N is the number 

of nodes or equivalently O 

Proof: From Lemma [T] we know that the nodes belonging 
to the maximum combination form a convex polygon. Any 
such polygon fitting inside the disk of radius R must have 
perimeter smaller than 2ttR. Since the nodes on the perimeter 
should be at least d away from each other, we conclude that 
the maximum coding number is 

2nR 



C n 



< 



d 



This combined with N — O ("f 2 ) or respectively A grid = 
O {(\) 2 ) yields the result. ■ 
A particular case of the above bound is the square grid. 
The number of nodes inside the disk is = 7r (^) + e (^) 
where e (^) < 2^/2tt^ is an error decreasing linearly with 
d. Thus we obtain an upper bound 



CZT < vi^v. 

So far we have shown that any network with fixed sep- 
aration distance d and uniform density, will have maximum 
coding number of 0(y/~N), where N is the number of nodes 
connected to the relay. In particular, the constant can be 
determined for any given grid and for the square grid is 
2 v / / 7r. The simulations show that the actual maximum encoding 
number is approximately half of that calculated above. The 
reason for that is basically that the valid polygon is always 
smaller than the disk of radius R and often close to the size 
of a disk of radius ^. 

It is interesting to bound the achievable maximum coding 
number from below as well. To obtain intuition about this 
bound we start with a non-homogeneous topology, the cyclic 
grid. We construct concentric cyclic groups of radius Ri = id, 
i = 0,1,..., L^J that fall inside the disk of radius R. Each 
cyclic group has as many nodes as possible such that the fixed 
separation distance condition is not violated. Such a topology 
exhibits different behavior depending on the selected origin (it 
is not homogeneous), nevertheless it helps identify a particular 
behavior of the achievable maximum coding number. The 

cyclic group at Ri has 

of radius R will have 



nodes. Thus the grid 
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A very good approximation is, 
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Fig. 2. Sketch for the proof of Theorem [2] 



Theorem 2. (Lower bound) In networks with nodes d away 
from each other and cell radius R, an achievable maximum 
coding number is 

1) CS^f = Sl(y/N) for cyclic grid with rem (R, 2d) -> 0, 



2) C\ 



3) 



cyclic 
max - 

and 

^square 



f2(viV) /or cyclic gn'd wzY/z rem (i?, 2d) — » d 



f2( yiV) for the square grid of d, 



where rem(x,y) is the remainder of the division x/y. 



Proof: 

For the cyclic grid when rem (R,2d) 



0: By focusing on 



the cyclic group with inf{z : Ri > 4|}, note that each node 
is 7^ + e from the center and thus the desired connectivity 
properties are satisfied for all nodes on the cyclic group. In 
this case, we can calculate the number of nodes in the group 
as 

(jcyclic ~ ' T 



arccos (l 



2d 2 " 
R 2 , 



which for large N is bounded from below by some linear 
function of y/~N. 

For the cyclic grid when rem (i?, 2d) — >• d: Now each node 
is y + d — e away from the center and thus we need to select 
those nodes satisfying the property of valid combination. For 
this it is enough that we leave an empty angle (j) such that if 
AOB is a diameter and AOC this angle, then CB < R. By 
solving this for the maximum number of points satisfying this 
property we get 



(jcyclic 



2tt 



arccos 



R 2 



1 



2^+df 

which for large N is bounded from below by some linear 
function of \/~N . 

For the square grid: we construct a ring around the circle 
of 7^ radius. The width of the ring is S wide enough to fit a 
whole squre of dimensions d x d. Such a square is bound to 
contain exactly one node of the grid. Using Figure [2j and the 
triangles relative to the small square, we calculate 6 as 

y/R 2 + d(bd + 4R)-R 



Thus we can show that d < 6 < ^y. If we use the largest 
possible value that guarantees that the ring contains one node 
at each step, namely 6 = l.bd, we can compute the angle that 
contains at least one node, which is of the order of d: 



(f)(5) = arcsin 



V2d 



Vd 2 + R 2 



Finally, we compute the angle which should be left empty in 
the valid combination such that any node outside this angle is 
reachable by the most distant node (the one at the bottom). 



u(S) = arcsin 



R 



>/2(ii+1.5d) 



This angle is of the order of Vd. Finally an achievable 
combination is obtained if we alternate and uj until we fill 
the circle. 

rysquare 

max " [4>(S) + lj{5)\ ' 

which is bounded from below by a linear function of a/TV. 
Note that the sparseness of the combination is due to u(S) 
and a possible reasoning is that the bound is constructed to 
cover all the cases, thus also the case that the uncomfortable 
positioning of nodes matches the second case of the cyclic 
grid above. ■ 
In O, relative results on convex polygons in constrained 
sets guarantee the existence of convex polygons of size 
Q(yfN) when the N thrown nodes are kept seperated by some 
distance. 

IV. Stochastic analysis 

Assume that the locations of the nodes are determined 
by a Poisson point process with density A. The connectivity 
properties of this model are well studied in the literature (see 
e.g. fT5lL 0). In our work, we assume that the network is 
percolating, i.e. the nodes are dense enough to ensure multihop 
communications. As in the deterministic case, we assume that 
a relay is located at the origin. For a Poisson point process 
this assumption does not change the distribution of the other 
points. The main result is an upper bound in probability for 
the maximum coding number. 

Theorem 3. In a random network determined by a Poisson 
point process with density A, the maximum coding number 
corresponding to combinations having the relay at the origin 
satisfies 



lim P 

A— )>oo 



(c max (A) = 0(A 1/2+e ) 



1, 



for any e > 0. 



Proof: Cover the disk of radius R around the origin by 
disjoint boxes of size ^= x A=. The number of nodes inside 
the boxes is denoted by Ni, i = l,...,n(A). The Ni are 
identically and independently Poisson(l) distributed and thus 
there is a sequence I n such that 



lim P ( max Ni 

,l<i<n 



h Or Irr 



1 



1, 
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where I n = O (^^) (see 0, El). Since n(A) = 0(A) 

log A 



lim PI max N{ 

A— )*oo \ i=l,...,n(A) 



o 



1. 



(3) 



v log log A 

Next consider a valid combination. By Lemma [T] the nodes 
of the combination form a convex polygon. The perimeter of 
any convex polygon is at most 2ttR because it is located inside 
the disk of radius R. Since at most 0(y/\) boxes of size 
^ x ^ are needed to cover the perimeter of any convex 
polygon, 



Cm a x(A) < 

This implies that 

P(c max (A) <0 



max NiO(VX) 

=l,...,n(A) 



a.s. 



>P( max Ni<0(g(\)) 

i=l,...,n(A) 



Setting g(X) = log A/ log log A, applying equation ^ and 
finally noticing that log A/ log log A = O (A e ) for any e > 
completes the proof. ■ 

V. Rate efficiency of a network coding 

COMBINATION 

We will focus on the downlink of a valid combination of 
size C. Without loss of generality, assume that the rate vector 
r = { r i}i=i,2,...,c is ordered, i.e., n < r 2 < • • • < rc, and 
that the flow set is permuted accordingly so that over the link 
{vo^D{fi)) packets are transfered at a rate r^. 

The data rate is computed as the number of packets of 
size P transmitted in a virtual frame over the time needed 
for these transmissions. Since an encoded packet is always 
transmitted at the lowest rate decodable by all receivers 
and assuming max-min fair allocation^] we can deduce the 
maximum throughput rate with network coding as 

rNc(C) C p Cn. 



i{r} 



The rate without NC would be 

r w (C) CP 



P_ 

n 



P_ 

r 2 



P_ 




rh, 



where is the harmonic mean of r. Choosing any m < C 
and allowing for combinations of size m at most, it is easy to 
see that if the criteria for valid combinations are fulfilled for 
combination of size C then they are fulfilled for all subsets. 
The corresponding achievable rate is 

CP 

r m (C) = - 



= C 



3 This condition of fairness provides the best network coding opportunities 
and it is usually the balance point where network coding gain is computed in 
multiclass networks. 






Fig. 3. Coding combination examples for C m ax = 6 and C n 
square grid with N = 81. 
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Fig. 4. Simulation of maximum coding number of a square grid inside a 
disk with iV nodes. C max is the upper bound and C m i n is the achievable 
lower bound provided in the theoretical analysis. 



Next, we derive the network coding gain for the maximum 
combination (C) and for the constrained group (m < C). 



9(C) 



^ r NC (C) 
r w (C) 



C 



rh 



Note that the gain is a linear function of C and depends on 
the particularities of the rate vector. Also, 



9m{C) 



. r m (C) 




where in the first inequality we have used that r± is the mini- 
mum rate, and in the second we have used m — 1 < r~r < m. 

If we choose equal rates, then we readily get g(C) = C and 
max {g m {C)} = m as the maximum gain for both. Finally we 
can symbolize that g(C) = 0(C) and moreover the difference 
g(C) — gm(C) = 0(C — m). Therefore, the efficiency loss 
is of the order of \/N — m which means that for carefully 
chosen m, the loss can be kept small. 
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Fig. 5. Maximum coding combination examples for C m ax = 6 and C m ax = 
8 in a uniformly thrown network with N = 21 and N = 70 respectively. 
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Fig. 6. Mean maximum coding number in a network of N uniformly thrown 
nodes. 

VI. Numerical results 

In this section we present some simulation results that pro- 
vide further evidence and insight for our work. For simulation 
purposes we consider a disk of radius R = 1 and a node vq 
serving as a relay situated at the center of the disk. Initially 
we consider a square grid of nodes over this disk and we 
investigate the maximum coding number, i.e., a set of nodes 
that satisfies the constraints of section [III Then, the scenario 
of uniformly random thrown nodes is considered. 

A. Experiments with square grids 

Figure [3] showcases examples of combination size C = 6 
and C = 8 (the maximum is 8 in this case). During these 
experiments we identified an interesting property. We noticed 
that the maximum coding number depends only on the number 
of nodes inside the disk and not on the actual d used. In 
particular, for all {d : N is fixed}, C max is constant. In Figure 
[4] we present the results for the square grid. The actual values 
seem to be closer to the lower bound than to the upper, leading 
to an order closer to \fN (notice the logarithmic scale of the 
figure). Nevertheless, the oscillating effect due to the interplay 
between the radius and the number of nodes is evident. 

B. Experiments with randomly positioned nodes 

Next, we throw N nodes uniformly inside the disk of 
radius R = 1. Examples of maximum coding number are 
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Fig. 7. Probability of existence of at least one coding combination of size 
C in a network of N uniformly thrown nodes. 

showcased in Figure [5] It is noted from these examples that 
large combinations tend to appear in a 5-xmg form where the 
inner side of the ring is a disk of radius ^ and the outer side 
is a disk of radius ^ + 5. 

In Figure [6j we present the mean maximum coding number, 
for different number of nodes N. In each sample, the maxi- 
mum coding number is calculated and the mean is obtained by 
averaging over 1000 random samples. The 0(y/N) behavior 
is depicted in this picture. 

Figure [7] shows the probability of existence of at least one 
coding combination of size C in a network of N uniformly 
thrown nodes. For example, the maximum component size 
for 20 < TV < 50 is either 4 or 6 in the majority of 
cases. The simulation results show that in real networks of 
moderate size the usual combination size is quite small. The 
multiplicative constant seems to be close to 1. In this context, 
the focus should be on developing efficient algorithms that 
opportunistically exploit local network coding over a wide 
span of topologies using small XOR combinations rather than 
attempting to solve complex combinatorial problems in order 
to find the best combinations available. 

C. A realistic scenario 

Next, we set up a realistic experiment to be run in simulation 
environment. A relay node is positioned at the origin, willing 
to forward any traffic required and apply NC if beneficial. 
Then, we throw y pairs of nodes randomly inside the disk 
defined by the relay and the communication distance R\ 
note that all nodes are valid nodes. Each pair constitutes a 
symmetric flow. Each flow may be valid or invalid depending 
on the distance between the two nodes, see definition in section 
|IT1 Whenever the flow is invalid, the nodes communicate 
directly by exchanging two packets over two slots (one for 
each direction). If the flow is valid, the relay is utilized to 
form a 2-hop linear network. Again, 2 packets are uploaded 
towards the relay using two slots while the downlink part is 
left for the end of the frame. Finally, the relay has collected 
a number of packets which may be combined in several ways 
using NC. To identify the minimum number of slots required 
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Fig. 8. A realistic scenario. Pairs of nodes forming flows are randomly 
thrown inside the disk (0, r). The relay node (situated at the origin) assists 
the flows that cannot communicate directly. We restrict NC combinations to 
size m where m = 2, 4, oo. 

to transmit those packets to the intended receivers, we solve 
the problem of minimum clique partition with the constraint 
of using cliques of size up to y (equivalently, combining up 
to m packets together). In the above we have assumed that all 
links have equal transmission rates and that the arrival rates 
of the flows are all equal (symmetric fair point of operation). 
The network coding gain is calculated by dividing the number 
of slots used without NC by the number of slots using NC. 

Figure [8] depicts results from simulated random experiments. 
Evidently, it is enough to combine up to two packets per 
time in order to enjoy approximately the maximum NC 
gain. This example supports the intuition that in practice the 
network coding gain from large combinations is expected to 
be negligible. 
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VII. Conclusion 

By considering the Boolean connectivity model and ap- 
plying the basic properties required for correct decoding, we 
showed that for the local network coding there are certain ge- 
ometric constraints bounding the maximum number of packets 
that can be encoded together. Particularly, due to the convexity 
of any valid combination, the sizes of combinations are at most 
order of for all studied network topologies. 

The fact that the number of packets is limited gives rise to 
approximate algorithms for local coding. Instead of attempting 
to solve the hard problem of calculating all possible cod- 
ing combinations, we showed that an algorithm considering 
smaller combinations does not lose too much. 
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